o 

(N 

< 

(N 



i> 






CONCENTRATION FOR MULTIDIMENSIONAL DIFFUSIONS 
AND THEIR BOUNDARY LOCAL TIMES 

SOUMIK PAL 



Abstract. We prove that probability laws of certain multidimensional semi- 
martingales which includes time-inhomogenous diffusions, under suitable as- 
sumptions, satisfy Quadratic Transportation Cost Inequality under the uni- 
form metric. From this we derive concentration properties of Lipschitz func- 
tions of process paths that depend on the entire history. In particular, we 
estimate concentration of boundary local time of reflected Brownian motions 
on a polyhedral domain. We work out explicit applications of consequences of 
measure concentration for the case of Brownian motion with rank-based drifts. 



1. Introduction 



Consider the sample space {n,J^) where fi is a metric space and J^ is the as- 
sociated Borel cr-algebra. We say that a probabihty measure fi on {^,J^) has the 
measure concentration property if the following statement holds. For any set A € T 
CO ' such that fJ.{A) > 1/2, one has ii{Ar) very close to one. Here fi{Ar) refers to the 

^ I set of all points that are at a distance no larger than r from A. The closeness is 

typically expressed as a gaussian tail estimate in r. Moreover, for fixed r, the prob- 
fvj , ability ii{Ar) tends to one exponentially fast in the dimension of the underlying 

fSJ ' space. Concentration inequalities and their applications have become an integral 

part of modern probability theory. See, for example, the seminal articles by Tala- 
(^ • grand [331 Hll Sil HSl HZ] • An excellent account can be found in the monograph [53] 

(^ [ by Ledoux to which we refer the reader for a survey of the (pre-2001) literature. 

Throughout this article our sample space is going to be some subset of the space 
of continuous function on [0, oo) denoted by C[0, oo) and products of such spaces. 
A sample path is denoted hy uj{t), < t < oo. The filtration will be the natu- 
. , ral filtration made right-continuous and suitably completed under the probability 

j^ ■ measures we consider. The metric on the sample space will be mostly given by the 

uniform metric: d{uj,uj') = sup^ \oj{t) — cj'(t)|. 

The probability measures we consider on the above sample space are the laws of 
multidimensional semimartingales including diffusions. The reason for considering 
diffusion laws on the path space has some strong motivation from applications. 
For example, consider the boundary local time of a reflected Brownian motion in 
an infinite wedge. Despite its significance in several areas of probability including 
queueing theory and mathematical finance (see, for example, the excellent survey by 
Fernholz and Karatzas [T7| for applications to Stochastic Portfolio Theory), very 
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2 SOUMIK PAL 

little is known about such local times. The challenge is the fact that such local 
times are functions of the entire history of the path (as opposed to being functions 
of the one dimensional marginals). We show how to estimate the fluctuation of 
such local times and explicitly compute the case of Brownian motion with constant 
drift reflected in the orthant {x G K" : Xi > a;2 > . • . > Xn}- Two points 
arc worth drawing attention to: reflected Brownian motion is a highly dependent 
system of processes; and, unlike a typical function concentration result, local times 
are not Lipschitz functions of the reflected process under the uniform norm. To the 
best of our knowledge there is no other way established in the literature to obtain 
concentration estimates of local times. 

The other compelling reason comes from mathematical finance. Given a financial 
market with a large (typically hundreds or thousands) number of stocks, one hedges 
risk by constructing a diversified portfolio. Informally, this amounts to distributing 
the capital in holding and rebalancing shares over the entire equity market so that 
the value of the resulting portfolio is impervious to market risk. It is now clear that 
in mathematical terms this amounts to a concentration of the value process around 
a certain deterministic path. In Section 13.31 we show through examples how such 
conclusions might be reached from our results on measure concentration. 

Before we proceed, let us add two caveats. One, we cannot compute expectations 
from concentration of measures, which require other methods. Two, because of the 
highly dependent structure in some of our examples our concentration bounds are 
not always Gaussian nor dimension independent. Gaussian tail estimate itself is 
a delicate property not shared by all stochastic processes. For example, consider 
the Bessel-square processes (see Revuz and Yor [40]). The marginal distribution of 
these diffusions are Chi-squares, which do not have sub-Gaussian tails. Thus, one 
cannot expect a Gaussian measure concentration property to hold for all Lipschitz 
functions of such processes. 

Our proofs depend on an original observation due to K. Marton [30l [311 132] : 
Concentration of measure is a consequence of what are known as Transportation 
Cost Inequalities (TCI). We explain this wonderful method in Section [2.11 In fact 
what we prove in the text is, under suitable assumptions, multidimensional diffu- 
sion measures satisfy Quadratic Transportation Cost Inequality (QTCI) (see also, 
Talagrand [47|, Dembo [8], Dembo and Zeitouni ^). Several other recent articles 
have taken a similar approach for proving concentration estimates for diffusions 
although ours is the first proof of QTCI for diffusions and other semimartingales 
w.r.t. the uniform metric. QTCI is unique in its advantages and is related to the 
log-Sobolev inequality, hypercontractivity, Poincare inequality, inf-convolution, and 
Hamilton- Jacobi equations. For details, please consult Otto and Vilani [36], Bobkov 
and Gotze [5], and Bobkov, Gcntil, and Ledoux [4]. Also see the recent articles by 
Gozlan [SO] [21] and by Gozlan et al. [22] which shows equivalence of QTCI with 
a restricted logarithmic Sobolev inequalities. The last article also gives the first 
proof that QTCI is preserved under bounded perturbation. 

Let us provide a brief review of literature of measure concentration in path 
space. Houdre and Privault [24] and Nourdin and Viens [35] use tools from Malli- 
avin Calculus to derive concentration inequalities for functionals on the Wiener 
space among other things. Proving TCI on the Wiener space using Girsanov the- 
orem, as we have done, first appeared in Feyel and Ustunel [18]. Djellout, Guillin, 
and Wu [10] provide characterization for L^-TCI for diffusions. They also prove 
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QTCI for diffusfons with respect to the Cameron-Martin L^-metric. Several articles 
in analysis and geometry are also devoted to this topic. Fang and Shao [HI [15] 
consider TCI on abstract path spaces on connected Lie groups, Fang, Wang, and 
Wu [16| consider TCI w.r.t. the uniform metric for special diffusions on a complete 
Riemannian manifold; Gourcy and Wu |19| considers log-Sobolev inequalities un- 
der the L^-metric; Wang [50l[5T], studies generalized TCI on complete connected 
Riemannian manifolds; and Wu and Zhang j52j prove QTCI for the uniform metric 
under an L^-contraction property of the diffusion semigroup. 

The outline of the article is as follows. In the following section we explain the 
connection between TCI and measure concentration. In Section [3] we prove QTCI 
for semimartingale strong solutions of SDE's under suitable assumptions in one 
dimension. By using tensorization properties of QTCI, in Theorem [6] we extend 
concentration results to multidimensional processes with independent coordinates. 
However, independent coordinates are of limited appeal in applications. In Theo- 
rem [7] we use a perturbation argument to prove measure concentration properties 
for dependent processes that arc locally absolutely continuous with respect to some 
multidimensional process with independent coordinates. The strenth of the con- 
centration depends on a Birnbaum-Orlicz norm of the Radon-Nikodym derivative 
of one measure with respect to the other. In Section 13.31 we discuss concentra- 
tion of Lipschitz functions of diffusions. This includes concentration of regular and 
stochastic integrals with respect to diffusions. 

Our main example is worked out in Section |4] where we work out concentra- 
tion estimates of local times for Brownian motion interacting through their ranks. 
These non-trivial processes can be described as follows. Let 61,62, ■■■ ^Sn he n real 
constants. Consider the following system of stochastic differential equations: 

n 

(1) dX,{t)^Y,6jl{X,{t)^X^j-){t))dt + dW^{t), i = l,2,...,n. 

Here X(i)(<) > X(2)(t) > ■ ■ ■ > X(„)(<) are the coordinates of the process in the 
decreasing order, and W = {Wi , W2 , ■ ■ ■ , Wn ) is an 71-dimensional Brownian motion. 
The SDE models the movement of n particles as interacting Brownian motions such 
that at every time point, if we order the positions of the particles, then the ith 
ranked particle from the top gets a drift 6i ior i = 1,2, ... , n. As time evolves, the 
Brownian motions switch ranks and drifts, and hence their motion is determined by 
such time dependent interactions. Such processes have been considered in several 
recent articles. Among the more recent ones, see Banner, Fernholz, and Karatzas 
[2], Banner and Ghomrasni [5], McKean and Shepp [33], Pal and Pitman [37], 
Jourdain and Malrieu [27], Chatterjee and Pal [6] [7], Ichiba and Karatzas [25] . 
Ichiba et al. [26], and Shkolnikov [42]. We refer the reader to the above articles for 
the list of applications of such models. They are similar in the discrete setting to the 
dynamic models of spin glasses studied by Arguin and Aizenman [1] , Ruzmaikina 
and Aizenman yiOj, Shkolnikov [4T] . 

As an example of a typical result one can derive from this theory, let us state a 
theorem we prove in Section |4j 

Theorem 1. Consider the model described in ([1]). Let Ljj+i{T) denote the local 
time at zero for the semimartingale {Xij\ — Xij^i\) up to time T . For any choice of 
parameters {5i, . . . , 6n) and constant initial points for the coordinate processes the 
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random variable x = maxi<j<„_i Ljj+i{T) satisfies the following tail estimate: 
^(|x-™xl>^"'^') <2exp|-^|, r> 2^/2 log 2. 

Here m^ is the median of x o-nd C refers to a universal constant. 

An improvement on this bound and other apphcations related to rank-based 
models have been done in the follow-up article by Pal and Shkolnikov [35] . Some of 
our results about TCI for multidimensional diffusions have been recently generalized 
in an article by Ustiinel [15] . 

2. Preliminaries 

2.1. Transportation cost and concentration. Suppose {X,d) is a complete 
separable metric space equipped with the Borel sigma-algebra. For all probability 
measures P and Q on this probability space, consider the p-th Wasserstein distance 

WpiP,Q)=mi[EdiX,X'ff'', 

TV 

where the infimum is over all couplings of a pair of random elements (X, X') such 
that the marginal law of X is P and that of X' is Q. 

Now, we fix a particular probability P. Suppose there is a constant C > such 
that for all probability measures Q <^ P we have 



(2) Wp(P, Q) < ^2CH{Q\P). 

Here H refers to the relative entropy H {Q \ P) = E^ log [dQ/dP). Then we say 
that P satisfies the \J' Transportation Cost Inequality with the constant C. When 
p = 2, this is often called a Quadratic Transportation Cost Inequality (QTCI). 

A function / : A" — > M will be called Lipschitz if there is a positive constant a 
for which 

\f{x)- f{y)\<ad{x,y), x,yeX. 

The constant a is then referred to as the Lipschitz constant. We shall call a function 
to be 1-Lipschitz if a can be taken to be one. Let C denote the set of all 1-Lipschitz 
functions on {X, d). The (very short) proof of the following theorem can be found 
in Lcdoux |29l p. 118] and the original article by Marton |30] . 

Theorem 2. Suppose that P satisfies QTCI with constant C. Then, one has the 
following concentration estimate for all r > 2\/2Clog 2: 

(i) For any measurable set A such that fJ-{A) > 1/2, one gets 

(3) fi{Ar)>l-cxp{-r^/8C}, Ar = {x e X : dix,A) < r} . 
(ii) And for any / £ £, one has 

(4) P {x : |/(x) - m/l > r) < 2e-'''/«^, 
where nif is the median of f . 

In general, any Wasserstein metric can be used to show ([2]), however the choice 
of J) = 2 is important due to its tensorization property we describe below. For the 
proof sec Lcdoux [29l p. 122-123] (although our statement appears slightly different 
from Ledoux's monograph, they can be easily seen to be equivalent). 
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Theorem 3. Suppose {Xi, di, Pi), i ~ 1,2, ... ,n be n polish spaces with correspond- 
ing distances and probability measures on them. Consider the product metric space 
A"" = Xi X ... X Xn with the distance 



d{x,y) = 



\ 



1 " 

Tt ^ ^ 



n 

i=l 



and the product probability measure on it P = ®Pi . Suppose that every Pi satisfies 
the QTCI with the same constant C , then P satisfies QTCI with constant C jn. 

Finally, wc need the following lemma which will be useful in the later text. Its 
(short) proof can be found in p^ . 

Lemma 4. [Lemma 2.1 in [10) / Suppose fi is a measure on a metric space (E, dE) 
that satisfies TCI with respect to the Wp norm with a constant C. Let {F,dp) be 
another metric space. If the map ^ : {E,dE) — > {F,dp) is Lipschitz, i.e., 

dF{'^{x),-^{y))<adE{x,y), for all x,y e E, 

then jj, ^ i^io v]/"! satisfies TCI with the Wp norm with a constant Co? on (F, dp). 

3. Concentration of diffusion laws 

Fix a finite positive time horizon T . Consider the metric space given by A" = 
C[0,T], along with the norm 

(5) d{u,Lo') = \\Lo-Lo'\\^{T)= sup \L0{t)-J{t)\, LO€C[Q,T]. 

0<t<T 

Also consider the product space X^ = C"[0,T]. Let u! ~ {uti, . . . ,aj„) and uj' = 
{uj'i, . . . ,uj'^) be two elements in X". Then 



(6) d{ijj,ijj') 



1 " 

-V sup \uj,{t) - uj[{t)\^ = 

\^i-{^<t<T \ 



^Eik--^iiL(r). 



n 

i=l 



3.1. Concentration for a fixed time horizon. Consider a progressively mea- 
surable drift function h{s,uj) from [0, oo) x C[0,oo) into R. That is to say, b{s, •) 
depends on the entire history of the process until time s. In particular, the pro- 
cesses we consider below need not be Markov. However, we do assume that the 
diffusion parameter a{t,X) depends only t and X{t). That is, ct is a function from 
[0,oo) X R into R. The following result is under the above set-up. 

Theorem 5. Fix a time point T > 0. Suppose X is a strong solution to the 
equation 

(7) dX{t):^b{t,X)dt + (j{t,X{t))dW{t), 0<t<T, X(0) = x(0), 
where the coefficients satisfy the global Lipschitz conditions 

\b{t,uj)~bit,Lo')\<Ki sup \uj{s)~uj'is)\=Ki\\uj-ij'\\^it) 

(8) 0^^^* 

and \c7{t,x) -a{t,y)\< K2\x~y\, 0<t<T, x,y£R. 

Additionally assume that diffusion coefficient is bounded, i.e., < a(t,x) < k. Here 
Ki , K2 , K are positive constants. 
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Let P denote the law of X considered as a probability measure on the metric 
space (C[0,r], ||-||oo). Then P satisfies the quadratic transportation cost inequality 
^ for the choice of 

(9) C = 4A.2Te4^(^i'^+4^='). 

Proof. The set-up of the proof is very similar to the proof of Theorem 5.6 in |10) . 
So we can conveniently skip some details. Consider the canonical sample space 
n = C[0,T] with a universal Brownian filtration {J-t-, < t < T}. The details 
of this construction can be found in Revuz and Yor [40l Sec. 2 Chap. 3]. This 
amounts to a filtration that is generated by the coordinate process and suitably 
made right-continuous and augmented with the null sets that are common to to the 
family of Wiener measures starting from any initial probability distribution. 

We consider the standard Wiener measure on this space, and thus the coordinate 
process, W{t), < t < T, is distributed as a standard Brownian motion. Since 
X is a strong solution to the SDE ([7]), one can construct a copy of the process 
X adapted to the above probability space that satisfies ([7]) with respect to the 
coordinate Brownian motion W. Let P denote the law of the process X on C[0, T]. 

Let Q be any probability measure absolutely continuous with respect to P. Thus, 
if we define M{T) to be the Radon-Nikodym derivative of Q with respect to P it is 
a function of X . In particular, M{T) is a measurable function with respect to the 
Brownian filtration constructed above. We now construct a martingale by defining 

(10) M(t) = E {M{T) I J"t) , < f < T. 

Then M{t) is a martingale with respect to the Brownian filtration. It follows then, 
see Revuz and Yor [40l Sec. 3 Chap. 5], that this martingale has a continuous 
version which can be written as a stochastic integral 



(11) 



M{t) = 1+1 H{s)dW{s), < t < T, 
Jo 



for some predictable process H. 

It suffices for QTCI to consider Q such that the Radon-Nikodym derivative 
is square-integrablc under P. We will assume that the martingale M is square- 
integrable for simplicity. 

Now, since M{t) is a continuous martingale, we use Girsanov's Theorem [40j 
p. 327]. Thus, under the measure Q, the process X satisfies the SDE 

dX{t) = b{t, X)dt + cr(i, X{t)) [dB{t) + M{t)-^H{t)dt\ 

= [b{t, X) + cr(t, X[t))M[t)~-^H{t)\ dt + a{t, X{t))dB{t). 

Here Bit) is a Brownian motion under Q. Note that M is never zero under Q and 
hence M~^ makes perfect sense. 

Now to bound the Wasserstein distance between the two measures P and Q, we 
need to couple the solutions of the two SDK's ([7]) and (fT2|) . To do this, we construct 
a solution of p2p on a filtered probability space with the Brownian motion B{t) 
running during time [0,r], and use this same Brownian motion to create a strong 
solution of ([7]). 
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Thus we have a coupled process: 

(•t ,- 



X«(t) 



Xq 



5(.s,X(i)) + ct(s,X(i)(s))M(s)^1H(s) 



ds - 



a{s,X^^\s))dB{s), 



X^^\t)^xa+ bis,X'^^^)ds+ a{s,X'^^\s))dB{s) 



Wc now estimate the uniform distance between these two processes. 

Define a nondecreasing sequence of functions ipnix) which serve as an smooth 
approximation to the function |2;| while satisfying 



(13) 



K(x)|<l, and 0<^':ix)< 



nA'oX^ 



The details on how such a sequence can be constructed can be found in [281 P- 291]. 
Now define 

A(i) := X(^\t) - X(2)(t) = / [6(s,X(i)) - bis^X'^^l) + a{s,X'^^\s))M{s)-^H{s) 



ds 



cr(.s,X(i)(.s)) - a(.s,X(2)(s))j dB{s). 
Jo ■- -* 

Then by Ito's rule 



ds 



(14) 



+ / i,'^{^{s))a{s,X(^\s))M{sr^H{s)ds 



+ - / V^;'(A(.)) a(.,xW(s))-a(5,X(2)(,)) 



ds 



^;(A(s)) k5,X«(s)) - a{s,X(^\s))\ dB{s). 



Now, by the condition ([S]) on cr and the property of the function -0" in ([T 



get 



(15) 



0<- / i,:[{^{s))\a{s,X^^\s))^a{s,X(^\s)) 



ds < -. 
n 



By using the Lipschitz property of the drift function 6 and (|13p we obtain 



(16) 



^;(A(5))ks,xW(s))-6(s,x(2)(,)) 



ds 



< Ki / sup |A(u)|ds. 

'o 0<u<s 



Let £,{s) be the process AI{s) ^H{s). Since a and -0^ are bounded functions, we 
get 



(17) 



^'^{Ais))a{s,X^'Hs))Mis)-'H{s)ds 



<K / \as)\ds. 
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The final stochastic integral above in ([T4| is a local martingale. We use Doob's L^ 
inequality [40l p. 54] to claim the following: 

(18) 



E sup 

0<s<t 



v;(a(k)) k^, x(«)(i)) - aK x(2)(«)) 



dB{u] 



<AE (C(AGs)))'ks,X(s)«)-a(s,X(2)(s))l ds < AKIE A\s)ds. 



The final inequality is due to the Lipschitz property of the coefficient a and ([T3|) . 

Combining the inequalities (fT4|) . (fT5|) . (fTBI) . (fTT]) . (fT8| . and applying Cauchy- 
Schwarz we get 

-E sup ^^(A(s)) < if^S { [ sup |A(7.)| ds) 

+ K^E(f \^{s)\ds\ +^ + 4K^E f A^s)ds 
(19) <Kft f E sup \A{u)fds + K^E(f \^{s)\ds 

Jo 0<u<s \Jo 

4-2 ft 

+ — + 4Ki / E sup \A{u)\^ds 
n Jo o<u<s 

< an + {KfT + AKi) E sup \A{u)f ds, 

Jo 0<u<s 

where the constant a„ is given by (again applying Cauchy-Schwarz) 

/■^ 
a„ = K^TE / S.^{s)ds + T^/n^. 



Now, one can construct ipn such that ipn{x) converges to \x\ uniformly on com- 
pact sets. Recalling that the sample paths are continuous and taking the limit as 
n goes to infinity in ()19p . we obtain 

E sup A'^{t) <4a+4:{KfT+4K^) f E sup \A{u)f ds, a = K^TE j £,'^{s)ds. 

0<s<t Jo 0<u<s Jo 

Let (j>{t) denote the quantity E'supQ^^^^ A^(f). Then it follows from above that 

(j){t) < 4a + i{KfT + AKi) f (t){s)ds. 

Jo 

By an application of Gronwall's lemma [40l Sec. 1, Appendix], we obtain (j){t) < 
^ae'"^KlT+AKl)t ^ Thus we get 

E sup A(t)2<4K2j.g4T(K?T+4if,^)^ /'^^2(^)^^^ 
0<t<T Jo 

Recall that we are using the uniform distance d{uj,uj') = supg<j<'p |w(i) — ^'{t)\ 
between two paths uj,uj' in C[0,T]. Thus, from our calculations above, we get 

(20) Ed^X^^\x'^^^) < 4A.2Te4(^?^+4Ki)^iJ [^ eis)ds. 

Jo 



CONCENTRATION OF DIFFUSIONS 9 

On the other hand (see [101 eqn. 5.7]), the entropy of Q with respect P is given 

by 

(21) H{Q\P) = ^EQJ aufdu. 

By combining the above inequahty with ()20|) wc get 

i?d2(x(i),x(2)) < 8.c2re4^(^'i^+4^")ff (Q I P) , 

which completes the proof. 

The final claim follows by the tensorization argument outlined in the previous 
section. D 

Theorem 6. For each i — 1,2, ... ,n, let bi(t, uj) be progressively measurable real- 
valued drift function on [0, oo) x C[0, oo) and Uiit, x) be measurable functions from 
[0,c») X R into R. Consider the following system of multidimensional stochastic 
differential equation: 

dX^{t)^b^{t,X^)dt + a^{t,X^{t))dW^{t), 0<t<T, X^{0)^X^. 

Assume that there are positive constants K and k such that for every i = 1,2, . . . ,n, 
the coefficients satisfy 

max \bi{t,uj)~bi{t,uj')\ < K \\uj - lo' \\ ^ (t) , uj,uj' e C[0,oo) 

l<i<n 

(22) .^.Ji'iax \cTi{t,x)-ai{t,y)\<K\x-y\, x,yeR 
and < min ai{t,x) < max cri{t,x) < k. 

l<i<n l<i<n 

Let P denote the product law of independent processes {Xi, . . . ,Xn). Then P, 
seen as a probability measure on the metric space (C"[0, 1], d), as in ^, satisfies 
the QTCI © for the choice of C ^ 4n~^K^Te^^'^^'^+'^'> . 

In applications however the multidimensional diffusions with independent coor- 
dinates are of limited use, although it is common to use dependent diffusions that 
are a Girsanov change of measure of independent ones. If this change of measure 
is not too drastic, one should expect concentration properties to transfer to the 
dependent diffusion case. The following perturbation result makes this precise. 

Wc prove the result for a general metric space and any TCI. However, for our 
purpose in this paper the metric space will be the path space and the metric will 
be the uniform metric. 

Theorem 7. Suppose P and R are mutually absolutely continuous probability mea- 
sures on a complete separable metric space {X,d). Let L be the logarithm of the 
Radon-Nikodym derivative process of R with respect to P. Suppose that P satisfies 
LP TCI (p > 1) with a constant C. Consider any A such that R{A) > 1/2. For all 
r such that 



r>2^2Clog2 + 4C||i||i, 
one has 

(23) 1 - R{Ar) < exp 



8C(1+4||L||<,) 
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Here, \\L\\-^ is the expectation w.r.t. R, and \\L\\^ is the Birnhaum-Orlicz norm 
(w.r.t. R) defined by 

(24) i|L||<j, :=inf {a>0: £:^$(|L|/a) < l}, $(i) = e*-t-l. 

Proof. Let A be a measurable subset and let Ar be as described in Q. For any 
measurable subset B, let vg denote the probability measure R, conditioned on B, 
i.e., 

R{Bn-) 

The measure vb is clearly dominated by R, and hence by P, due to the assumed 
mutual absolute continuity. 

Consider the Wasserstein distance between va and i^b, where B is the com- 
plement of Ar- By the triangle inequality and the fact that P satisfies TCI, we 
get 



(25) Wp{,yA,'yB) <Wp{iyA,P) + Wp{i^B,P) < V^CH {va \ P) + ^2CH (vb \ P). 

Now, since d(j4,i3) > r, for any coupling between X '-^ va and Y '^ vb, it 
follows immediately that d{X, Y) > r. Thus, it follows that the left side of the 
above inequality is at least as large as r. We estimate the right side below. 

Obviously 

H(„,|P)....(,„,^)=...(,„,^)....(,„,f 

In other words 

(26) H{ua\P)<H {va I R) + max ( 0, E"-'^ ( lo; ^ 



dP 



Now, by assumption, log dR/dP = L. Thus 



(27) i.- (log §) ^ ^^E- iHAW < ^^E- iliA}L) . 
Select A such that R{A) > 1/2. Then, we get 

(28) j^^E^ il{A}L) < (RiA))-' E^L = 2\\L\\,. 

For the set i?, we follow a similar line of argument except for the final estimate 
above. We use a pair of Young's function (i.e., convex conjugates, see Neveu [Ml 
Appendix, p. 210-213]) $ and * given by 

$(i) = e* - i - 1, *(«) = (1 + v) log(l + v)-v. 

Recall the definition of a Birnbaum-Orlicz norm for suitable random variables: 

ll^ll^ := inf {a > : S^$(|X| /a) < 1} , ||y||^ := inf {a > : £;^*(|y| /a) < 1} . 

We will use the following generalization of the Holder's inequality for Young func- 
tions (see [Ml Appendix, p. 210-213]): 

-^£;« il{B}L) < 2 IILIU, \\1{B}/R{B)U ■ 
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Let us estimate \\1{B}/R{B)\\^. For any a > 0, we get 



^«^ 



(29) 



aR{B) 



E 



RiB) I 1 + — ^) log fl ^ 



aR{B)J °V ai?(B) 
Note that i?(B) < 1/2 since R{A) > 1/2. We claim that 

(30) \\l{B}/R{B)\\^<a:= 



HB} ] 
aR{B) J 
1 
a 



1 

R{W) 



l-R(B)logj^ 

It suffices to check that for this value of a, the expression in (p9|) is smaller than 
one. To see this, note that, by our definition 

1 1 



1 



Thus 



HB} 



aR{B) 



R{B)[l 
1 



aR{B) R{B)\ogl/R{B) 

1 \. A 1 

1 



aR{B)J '^^y^ aR{B) 



log ■ 



1 



log(l/i?(i3)) ^ R{B)\ogl/R{B) \ogllR{B) 
1.1 1 



1 



■ log 



+ i?(B) 
R{B) 



log(l/i?(B)) ^logl/i?(S) \ogl/ R{B) 

Let 1/x := — log i?(i?). Our claim will follow if we show the following to be 
negative: 

1.1 1 



h{x) = X log X — X + e "^'^ 



■log- 



R{B). 



\og{l/ R{B)) °\ogl/RiB) logl/RiB) 

Since < R{B) < 1/2 it is enough to check in the interval (0, l/log2). We claim 
that in this interval the function h is convex. To verify, note that 



h'{x) = logx + x'-^e-V^, h"{x) 
Noting that e^/^ > 1 + l/x, we get 



1 



-l/x 



1 



-l/x 



=. -r-4p-l/:^ 



h"{x) = x-^e 



= x-^e-^l'- 



c^e^/^ - 2.T + 1 > x-^e-^l"" [a;^ + x^ - 2a: + l] 

{x^ + (x - \f\ > 0. 

This shows that h is convex. Thus, to show check for the negative sign of h 
it is enough to check at the end points. Plainly h{0+) = 0, and numerically 
/i(l/log2) w —0.41 < 0. By convexity it now follows that that h{x) is negative for 
a: g (0,1/ log 2). 

This proves the claim pop . In fact, we will simplify our choice of a slightly more 
by defining 

a = 2 log — - — - , 

which is larger than the choice in ([50]) since R{B) < 1/2. 
Combining our argument so far we obtain 



(31) 



1 
R{B) 



E^ (1{B}L) < 4\\LU\og 



1 

R(B)' 
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Thus, combining the above with ^, dMI, dUl), dSH]) we get 



"■ < Jh {va \R) + 2\\L\\^ + Jh {vb I i?) + 4 ilLJI^ log ■ ^ 



Note that 

iJ(zy^ |i?)=log-^<log2, H{uB\R) = \og- "^ 



R{A) - ->= ' V - I V -^e, ^^^^ ■ 

Thus 



"■ <w'log2 + 2l|L||, + J(l + 4l|L||^)log. ^ 



V2C - V => ^ ' y^ " "*' °R{B)- 

Note that i?(-B) = 1 — i?(A,.). Hence for all r larger than 



2iy2Clog2 + 4C||L|l„ 
one has (say) 

2C(l + 4||L|U)log^ ^> 



1 - R{Ar) 



Or, by rearranging terms, we get 

1- R{Ar) < exp( -- 



r' 



8C{1+4\\L\\^)^ 
This proves the assertion. D 




Remark. To get a feeling for the Birnbaum-Orlicz norm ||-||^ used in p4p . let us 
compute this norm for the case when L(T) = B{T), a one-dimensional standard 
Brownian motion. For any a > 0, a quick calculation will show 

E (e-"l^mi) - -E \B{T)\ - 1 = 2e^/2a^<i> (VT/a) - 

If we take a = -v/T, the above expression reduces to 2y/e'^{\) — \p^^^ — 1 which 
comes to about 0.976. In other words \\B{T)\^ « a/T. 

Finally note that so far we have assumed that the starting points of the processes 
are given constants. When we randomize the starting values, it is not obvious 
what happens to the Transportation Cost Inequality of the mixture. This general 
problem is studied under the rubric of dependent tensorization and TCI (specialized 
to Markov chains of size two). See |3^ and [TD] for more details, in particular 
Marton's coupling for Markov chains. 

However, if we are interested in only concentration of measures, certain bounds 
can be easily obtained. 

Lemma 8. Suppose fi is a probability measure on a metric space {E^cIe)- Let 
{F^dp) be another metric space. Suppose there is a regular conditional probability 
Px, X g £■, which is a probability measure on the Borel a-algebra of F. Assume 
that each P^ in the support of fi satisfies QTCI with a constant C . 

For any Lipschitz function f : F ^ M., let mf{x) denote the median of f with 
respect to the probability measure Px. Let m denote any constant. Then, if P 
denotes the randomized measure J Px{-)fJ,{dx), one gets 

P{\F-m\>r) <2exp I -^^\+fi{x: |m/(x) - m| > r/2) , r> 2^2 log 2. 
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Proof. Follows from the triangle inequality. D 

To specialize the above result to our set-up at hand, take {E, (Ie) to be R" under 
the Euclidean norm and [F^dp) to be C"[0,r] under the d norm. One can take 
m to be either the expectation or the median of the numbers mfix). Much better 
bounds can be obtained if we know that the Markov semigroup of the diffusion has 
some contraction properties. See the analysis by Wu & Zhang [52] in this direction. 

3.2. Concentration for infinite time horizon. For any two paths wi and cj2 in 
C[0, oo), we denote the uniform metric on their restriction to [0,n] by d„(aji, W2) = 
supo<t<„ {\ijJi{t) — a;2(i)|). Then consider the locally uniform metric 

.o^^ ( ^ c„d„(a;i,a;2) 

{61) /9(aji, aj2) = rnax ■ 



1 + rf„(wi,W2) 

for some sequence of positive numbers c„ (to be specified later) such that lim„_^oo Cn = 
0. It is well-known that this metric makes the space C[0,oo) a complete separable 
metric space. 

The concentration results of the last subsection can all be extended to this case, 
although presumably it is less useful since it is more difficult to check Lipschitz- 
ness of functions with respect to the local metric. We include a statement for 
mathematical completeness specialized to arbitrary finite stopping times. 

Consider a stopping time r with respect to the right continuous filtration on 
C[0, cx)) that is continuous with respect to metric p. Denote by C[0, r] the metric 
space of paths in C[0, 00) such that t{u:) < 00 and w(i) = Wr for all t > t. Clearly 
C[0, r] is a closed subset of a Polish space, and is hence Polish itself under p. 

Theorem 9. Consider the same one- dimensional process as in Theorem^ (with 
Ki = A'2 = K) stopped at a continuous stopping time r. Let P denote the law of 
the stopped process. Then P satisfies QTCI with the constant C given by 

C = 4K2maxc2ne4^'("+4). 

n 

In particular, if we choose Cn — n~^/^ Gxp(— 2A'^(7^ + 4)) in (|32p . then one can take 

Proof. The proof is very similar to the proof of Theorem [S] Consider the coupling 
of processes X'^^ and X*^^' using the same driving Brownian motion, and consider 
the stopped processes X^^^^^ and X\f^^,^ . Here ri and T2 arc copies of r applied to 

paths of X(i) and X(2) respectively. Let A(t) = x[]^^^ - x[%^. 

Notice that the argument in the proof of Theorem [5] goes through for stopping 
times until inequality (|19p which gets now modified to 



(33) -E sup V'(A(s)) < a„(t) + K'^{t + 4) 

4 0<,s<t 

Here 

/•tATi 

a„(t) ^K^tE / i{sfds + 
Jo 
Thus, as before, by Gronwall's lemma we get 



/ EA{sfds. 
Jo 



t^ 

irt2 



(34) 



/•nAri 

^4(X(i),X(2)) = E sup Aisf < 4A.2ne4^'(«+4)s / ^{.s)^ds 

0<s<n Jo 
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Now, 

4(xw,x(2)) 



Ep^{X^^\X^'^'>) <m&xclE 



i + dl{xw,x(^)) 

■tATi 



Jo 

<CE f ' ^{sfds. 
Jq 

The rest of the proof is similar to the proof of Theorem [5j D 

3.3. Classes of lipschitz functions. In this section our objective is to work out a 
Hst natural examples of functions on the path space that are Lipschitz with respect 
to the uniform norm. Our aim is to show that paths of random processes derived 
from multidimensional diffusions lie in a cylinder around its "expected path" with 
exponentially high probability. Toward that aim, under suitable assumptions, we 
show concentration of processes of the type L TT{u)du where tt is an adapted process, 
and of adapted local martingales. 

We specialize to the case of T = 1 and 1-Lipschitz functions. Any other value of 
T or of Lipschitz constant can be reduced to this case by scaling space and time. 

Lemma 10. Suppose {/(i), < f < 1} &e a collection of functions f{t) : C"[0, 1] -^ 
M which are 1-Lipschitz with respect to d. That is, if lu and to' are elements in 
C"[0,1], then 

1 " 

\fit)iio)-f{t){u')f<-J2\H~^[C for every < i < 1. 

2=1 

Then the following functions are also 1-Lipschitz. 

(i) supj f{t) when the supremum is measurable. 

ill) For any Lipschitz function : R — > R, the composition (j)o f{t). In particu- 
lar, — f{t), \f{t)\, and \f{t) — a{t)\, where a{t) is any non-random function. 
(iii) The functions g(t) = J^ f(u)du, < i < 1. 

Further, suppose f{t) is not known to he a priori Lipschitz. Let uj = (wi, . . . , a;„), 
where each uji € C[0,1]. Let f^{t) denote the function f(t) as a function of tUi, 
while the rest of the coordinates are kept constant. Then, if for every choice of i 
and ujj, j 7^ i, the function f'^t) is Lipschitz in uii with coefficient 1/n, i.e., 

|rw(a..)-rw(^-)|<^ik-^:iioo, 

then fit) is 1-Lipschitz with respect to the d norm. 

Proof. The proofs of (i), (ii), and (iii) are obvious. To see the second part, choose a 
pair Wjw' in the product space. Construct a sequence of vectors ?7(1), . . . , rj{n + 1) 
in C" [0,1] such that 

w', if j<i-l, 






oj.,-, otherwise. 



Thus 7;(1) = cj and r]{n + 1) = uj' . 
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By the triangle inequality and the property of being separately Lipschitz in each 
coordinate, we get 

n 

|/(t)H - f{t){u:'f <nY^ \f{t){^{^ + 1)) - /(t)(r;(z))|^ 

1 " 

-E 



1=1 






This shows that /(f) is 1-Lipschitz with respect to the d norm. D 

We have the following corollaries. We focus on the set-up in Theorem|6l although 
please keep in mind that for the following Gaussian concentration bounds and L^ 
TCI would suffice. In particular they hold for diffusions satisfying the conditions 
in Corollary 4.1 in jTOl . 

Corollary 11 (Concentration of regular integrals). Consider the set-up in Theorem 
[51 Suppose that {/(t), < t < 1} is a measurable real-valued process on C"[0, 1] 
such that each f{t) is l-Lipschitz with respect to the metric d. Consider the process 
of integrals 

g{t){uj) = / f(u)(uj)du, and fi{t) = median of g{t), < t < 1. 
Jo 



Then, when (1221) holds, we have 



P 



sup \g{t) - fj.(t)\ - ^j. 

0<t<l 



> r] <2e 



-nr^/8cT^ 



for all r > la^/ln ^ log 2 where /i is the median o/supQ<j<2. 1.9(0 ~ m(OI under the 
measure P. 

The proof is straightforward application of Lemma [TU] conclusion (iii). The 
supremum is measurable since g(t) is always continuous. 

The last two corollaries are significant in mathematical finance where the value 
process of a portfolio is often expressed as a stochastic integral. In particular, it is 
a martingale under, what is known as, the risk-neutral measure. The final value 
of such a martingale is often determined externally (i.e., pay-off from an European 
derivative). These corollaries together with Theorem [7] can determine whether such 
portfolios can be impervious to random market risk. 

Corollary 12 (Concentration of martingales) . Consider the set-up in Theorem]^ 
(iii). Let {Nit)^ < t < 1} 6e a P-martingale such that Nq ~ and Ni is a 
Lipschitz function with respect to the metric d. Then the following concentration 
inequality holds 



(35) 



sup \N{t)\ -V 

0<t<l 



>r\ < 2e-"'''/*^'"' 



for all r > 2a ^2n ^log2, where v ~ E^ supQ<j<]^ 1-^(01- 

Proof. This is straightforward since A'^i is Lipschitz implies every other N(t), which 
are conditional expectations of iVi, must also be Lipschitz. D 
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3.4. Local times. Now we come to the discussion of a particularly important 
class of functions in the study of continuous stochastic processes: the local time. 
Consider a standard Brownian motion and let L{t) denote its local time at zero. 
Then, by definition (see 01 P- 227]) 

L(i)-liml / l{\B{s)\ < e}ds. 

This is clearly not a Lipschitz function of the paths of B{t). However, it is well- 
known that Brownian local time has Gaussian tails. In fact, a famous theorem of 
Levy states that L{t) has the same law as |-B(i)|. One way to prove this concentra- 
tion is by considering the Tanaka decomposition [301 P- 239]: 

\B{t)\= f sgn{B{s))dB{s)+L{t). 
Jo 

If P denotes the process /„ sgn{B{s))dB{s), then Levy (and later, Skorokhod) 
showed that L{t) = - info<s<t /3(s) A (see gOl P- 239]). Thus L{t) is a 1-Lipschitz 
function of the paths of (3. Since /? is another standard Brownian motion, it imparts 
Gaussian concentration to the local time function. 

This notion of obtaining local time as a Skorokhod map has been greatly gen- 
eralized. For the rest of text we will focus on such generalizations. We refer the 
reader to the articles by Dupuis and Ramanan |121 113] from which we borrow our 
description of the so-called Skorokhod Problem which we describe below. 

Consider a closed set G C R" and a set of unit vectors d{x) for each point x on 
the boundary of G (say dG). Let Z?"[0, oo) be the set of maps from [0, oo) to R" 
that are right continuous with left limits. For fj <E £'"[0,oo) let \fj\ (T) be the total 
variation of fj on [0,T] with respect to the Euclidean norm. 

Definition 1. Let i/j S I?"[0,(X)) with ipo G G he given. Then (0, fy) solves the 
Skorokhod Problem (SP) for ip with respect to G and d ii (j)o = "ipo, and if for all 

t g [0, oo) 

(i) cl>{t)=^{t)+fi{t); 
(ii) m e G; 
(iii) \fi\ (t) < oo; 

(iv) \fi\{t)=J^l{cl^{s)edG}d\f^\{s); 

(v) There exists measurable 7 : [0, 00) — )- M" such that 7(5) G d{(f>{s)), the set 
of direction vectors at the point (j){s) (|7y|-almost surely), and 

ry(i)= / ^is)d\r-i\{s). 
Jo 

When, the solution of SP exists (and is unique) for a large enough subset of the 
path space, the map that takes ip to cj) is called the Skorokhod map. The Skorokhod 
map, following an original idea due to Skorokhod, is used to construct stochastic 
processes that are constrained to remain within G by reflecting them inwards at 
the boundary dG in the direction given by the vector field d. 

A Skorokhod map is Lipschitz is there exists a positive constant K such that if 
(0,77) and ((/)', 77') are solutions to the SP for ip and ip' respectively, one has 

supMt)-v'm<Ksnp\m)-^'{t)\\, 

f36) *-" *~° 

sup \m)- 4'' m<K sup \m)-i;'it)\\. 

t>0 t>0 
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Here ||-|| is the regular Euclidean norm. The constant K is then called the Lipschitz 
constant of the Skorokhod map and -0 is called the driving noise. Note that the 
norm used in (j36|) is weaker than the d norm we have been using so far. 

Corollary 13. Suppose the Skorokhod map is 1-Lipschitz. Then, if the noise pro- 
cess Tp is chosen randomly with a law satisfying Theorem^ then constrained random 
path 4> and the local time function rj also satisfies QTCI with respect to the metric 



d{ui,ui') — sup 

0<t<T 



\ 



1 " 

-Y,{^^{t)-L0[{t)f 



i=l 



As a corollary we obtain that reflected Brownian motion (RBM) satisfies QTCI 
with respect to the above norm whenever the above Skorokhod map is Lipschitz. 

Proof. This is a corollary of the fact the QTCI is preserved under Lipschitz maps. 
See Lemma m The fact that d < d \s straightforward. 

To obtain RBM, one needs to take the driving noise as a typical path of multi- 
dimensional Brownian motion that satisfies QTCI by Theorem [S] □ 

Note that the above result for RBM is not useful since we do not know the QTCI 
constant (which will depend on n). This is a rather non-trivial job and something 
that we work out in detail for a specific example in the following Section. However, 
several natural conditions that guarantee when the Skorokhod map is Lipschitz can 
be found in [13], [TTl Thm. 2.2], [H Thm. 3.2] and [H Thm. 2.2]. We wih the 
following result from Dupuis and Ramanan |13] . 

Theorem 14 (Theorem 2.2 in |13j). Consider the Skorokhod problem on a poly- 
hedral domain G = {a; G K" : {x,rii) > Ci} for some vectors rji and scalars Ci, 
where i = l,2,...,n. Suppose the vector of direction of constraints (or, reflection) 
is constant on each face and are given by the vectors di, i ~ 1,2, ... ,71 which are 
linearly independent and satisfy [di^rji) = 1 for each i. Define the matrix 



(37) Q = [ftj] 



\^- {di,-ni)\, if i = j- 



If the spectral radius of Q satisfies a'{Q) < 1, then the Skorokhod map is Lipschitz. 

4. Applications to rank-based models 

Consider the model described in ^ in the Introduction. We call such a model as 
an n particle rank-based model. For finite n, with arbitrary initial values of Xi(0) 
and arbitrary drifts 6i, the existence and uniqueness in law of such an n particle 
model is guaranteed by a standard application of Girsanov's Theorem. Please see 
Lemma 6 in [37j . A part of that lemma is reproduced below. 

Let S = {5i,l < i < N) e M.^ and let /j, be an arbitrary probability distribution 
on M.^ . Consider the canonical sample space and filtration for the multidimensional 
Brownian motion described in Section [S] Let {Xi, . . . ,X„) denote the coordinate 
map. Let P^'^ denote the law of the n-particle rank-based model where the initial 
position is distributed as /i. Thus, P^'^^ is the Wiener measure starting from /i. 
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Lemma 15. For each t > the law P^'^ is absolutely continuous with respect to 
P*^'^ on Tt, with density 



( N f ^ \ 

exp^5,/3,(t)--5:<5f 



(40) dX(,)(t) - d/3,(t) + —{dL,^,+,{t) - dL,-,^j(t)) 



(38) 

where j3j can he defined by the expression 

(39) /3, {t)=Y. 1 { ^' ('^) = ^0) {s)]dX,{s), l<j<N. 

Under P •'^ the Pj 's are independent Brownian motions on M with drift coefficients 
Sj and diffusion coefficient 1. 

The other lemma we require considers the law of the ordered particle system 
under P^'^'■. 

^(l)(t)>^(2)(t)>...>X(„)(t). 

This ordered system, as shown in [2] and |37] , is a reflected Brownian motion in the 
cone {a; G M" : xi > X2 > ■ ■ ■ > Xn}- The following is in Lemma 4 of [37] j except 
that our ordering in ([T]) is the reverse of the notation used in [37] . 

Lemma 16. Let Xi^l < i < n be a solution of the SDE ([l}, defined on the 
canonical space, for some arbitrary initial condition and arbitrary drifts {5i\ . Then 
for each 1 < j < n the jth ordered process Xij\ is a continuous semimartingale 
with decomposition 

1 

V2^ 

where the /3j 's for 1 < j < n are independent Brownian motions (with respect to 
the given filtration) with unit variance coefficient and drift coefficients Sj , and are 
the same as appearing in ([M)) . 

Moreover, Lo,i = -^n.n+i = 0, and for \ < j < n ^ 1 

(41) L,-,+i(t)=liml / l((X(,)(s)-X(,+i)(s))/V2<e)d5, i > 0, 

which is half the continuous increasing local time process at of the semimartingale 
(X(j+i) — X(j))/v2- Moreover, the ordered system is a Brownian motion in the 
domain 

(42) {x e R" : xi>X2> ■■■> Xn} 

with constant drift vector {5j, 1 < j < N) and normal reflection at each of the n— 1 
boundary hyperplanes {xj^) = 2;(i_|_i)} for I < i < n — 1. 

4.1. Concentration of intersection local times. Our objective is to show that 
the vector of boundary local times {Lj,j^i, j = 1,2, . . . ,n — I) satisfy the QTCI. 
We start with a lemma. 

Lemma 17. Consider the rank-based model ([T]). Let X denote the center of mass 
process 

— 1 " 

xit) ^ -S^ xJt), t>o. 



1=1 



Then 
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(i) X{t) — ^(0) is a Brownian motion with mean 5 ~ "-^^X]"=i ^^ '^'^'^ diffu- 
sion coefficient 1/n and is independent of the vector of spacing s {Xu\{t) — 
X(,+i){t), l<i<n-l, t>0). 

(ii) Let P he an independent one- dimensional Brownian motion (/3o ~ 1) with a 
negative drift —1 which is reflected at the origin. Then the process defined 
by 

(43) y^(i)^X(,)(i)-X(t) + 4^, ^ = l,2,...,n, i > 0, 

y/n 

is an n-dimensional Brownian motion with constant drift and identity co- 
variance matrix which is normally reflected in the wedge G = C\^^iGi, where 

(44) G^ = {xi - x,+i > 0}, 1 < i < n - 1, and G„ = < ^ x; > I . 

Proof. The proof of the first assertions can be found in Lemma 7 in [37]. The 
argument for the second assertion uses Lemma [TBI Please see [37l p. 2188]. D 

Notice that the spacing vector between the y^'s and that between the X^'s are 
the same, i.e., 

Yi-Yi+i= X(i)- X(i+i), for aU i. 

Thus, the local time at zero for every F^ — y^+i by v^ is exactly Li_j+i/2, a fact that 
we use below. The advantage of considering li's is that now we can use Theorem 

M 

Theorem 18. For any choice of parameters ((5i, . . . , (5„), the vector of increasing 
random processes 

Ljj+i{t), j^l,2,...,n-l, 0<t<T, 

satisfies the QTCI with respect to the metric 



d{uj,uj') — sup 

0<t<T 



n 

-Y,{u:.{t)^u'S)f 

\ "»=1 



with a constant Cn^T where G is a universal constant. 

Remark. Let us point out to the reader that a better bound of n^T has been 
obtained in the follow-up article by Pal and Shkolnikov [38]. We strongly believe 
that this is optimal although we cannot prove it. 

Proof. Consider the Skorokhod map, described in Definition [l] on the polyhedral 
domain G given in (|44p . As in Theorem [Ml we can take the vectors 

e(i) — eii + 1) 
rj{i) — -j= , and d{i) ~ r]{i), for each i — 1, 2, . . . , n — 1. 

v2 

Here the e(i)'s represent the standard basis vector in R". For i = n, wc need to 
take r]{n) = n^^/^l, and d{n) ~ ri{n), where 1 is the vector of all ones. 

Then the d{i)'s are linearly independent and satisfies {d{i), ri{i)) = 1. Define the 
n X n matrix D by 

D = [d(l) I d{2) I ... I d{n)] . 
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Then D has full rank. The matrix Q in (|37p is then given 

ID, II I = ], I = n, or ] ~ n. 

Here D* represents the matrix transpose of D. 

It is easy to see that Q is a submatrix of a stochastic transition matrix. That is, 
consider P to be the transition probability matrix of a simple symmetric random 
walk on the integers {0, 1,2,..., n}, with absorbing boundary conditions. We then 
identify states and n by changing the P matrix to have P{n — 1,0) = 1/2 = 
1 — P(n — l,n) and P(n, 0) = 1 = 1 — P{n,n). Then Q is the submatrix of 
transition probabilities corresponding to states {1, 2, . . . , n}. Since these states are 
transient, it follows that a{Q) < 1. Thus, by Theorem [TH the Skorokhod map on 
this domain is Lipschitz. We need to estimate the Lipschitz coefficient. 

To do this, we need to find the set B in Assumption 2.1 (Set B) in [T2J p. 160]. 
We follow the notation used in [121 [13]. Let SP{d{i),ri{i),0) denote the Skorokhod 
problem in our domain G. Consider another Skorokhod problem SP(e(i), D*rj{i), 0), 
where D* refers to the transpose of D. As noted in [131 P- 203] (and easily veri- 
fiable), the matrix Q corresponding to this problem is the same as the Q in ()45|) . 
Thus the Skorokhod map is again Lipschitz. 

Let B be the set satisfying (Assumption 2.1 [El p. 160]) for the SP(e(i), D*i]{i),Q) 
for some 5 > Q. Then, as mentioned in [TS] p. 202], B = DB satisfies Assumption 
2.1 for the original problem SP((i(j),77(i), 0). 

We claim the following description of the set B. Consider a vector u such that 

(46) Qu < u, Ui > 0, i = l,2,...,n. 
Then, we claim that 

(47) i? = {.T G M" : \x,\ < wj , 6 = min {u, - {Qu),) . 

i 

Let us verify the conditions of Assumption 2.1 in [12j . Consider a boundary point 
z G dB. Then, without loss of generality (and considering only points in the relative 
interior), we can assume that Zi = Ui, for some i, and \zj\ < Uj for all j ^ i- In 
that case the inward normal to z is — e(i). 

For our choice of S, assume that, for some j, one has 

\{z,D*r,{j))\ = \{z,iI-Q)e{j))\<6, since 7?(j) = d(j) = De(j). 

Now, if j ~ i, then 

Zi+l + Zj-l 



\{z,{i-QHm 



>Ui = u., - [Qu)i > d, 



since every Uj is positive, \zj\ < Uj, and Qu < u. Thus, \{z, D*ri[j))\ < 5 implies 
j 7^ i, and hence {u,e{j)) = — (e(i),e(j)) = 0. This verifies [121 eqn. (2.2)] and 
establishes our claim. 

It is straightforward to see that the same 5 works for the set B = DB as well. 
To wit, take any point y S dB. Then, y = Dz, for some z G B. Thus {y,'r]{j)) = 
{z, D*r](j)) < S implies — (z^, e(j)) = 0, where v, as before, is the normal vector 
at z. But the normal vector at y, i/y, is obviously {D^^)*iy. Thus {iyy,d{j)) = 
{v,D-^d{j)) = {v,e{j)) = 0. This satisfies Assumption 2.1 in [12]. 

Thus to find the Lipschitz coefficient we need to find a vector u satisfying (|46|) . 
Let v{x) be any nonnegative strictly concave function on [0, 1] such that w(0) = 0. 
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Define u{x) = v{x/n) ior < x < n, and let Uk = v{k/n), for k = 1,2, . 
example, we choose v(x) = x{l — x). Then 

v"{x) = -2, 0<x<l. 

Then, by the strict concavity of v it follows that 



, n. For 



W/c > 2 i'^k-l +Ufe+l), 



1,2,.. 



1. 



1. For i ~ n the 



Hence, it follows that {Qu)i < Ui for every i = 1,2,..., 
inequality is not strict. However, this is easy to correct since the row Qn* is the 
zero row and we can choose w„ to be any small enough positive number. 
Now, for our choice of v, it follows that 

5 = min(ui - (Qu)^) > - inf -u" {x) = —^ inf -v"{x) = ^r. 

I 2 a<x<n 2n^ 0<a;<l n^ 

In fact, to simplify, we define 5 ~ n^- for the rest of the analysis. 

Now to find what the Lipschitz constant is for the Skorokhod problem on the 
wedge G wc use the Remark made in page 161 in [12]. If the set B constructed 
above satisfies Assumption 2.1 in [12] for some i5 > 0, then one plus the diameter 
of S^^B serves as the Lipschitz constant for the Skorokhod map. We already know 
what S is. Let us now estimate the diameter of the set B in (|T7l) . Since extreme 
points are preserved under linear transforms, we get 



diam(i3) < 2 max 



^ aiUid{i) 



where the maximum is running over all choices of coefficients ui = ±1. 

We claim that the above diameter is of the order ^Jn. This is easy to see since 
each d{i) has norm one and their inner products are given by the entries of the 
matrix D*D. Thus by expanding the vector X]i=i '^i'^id{i) for any choice of a we 
get 



y^ (TiUid{i) 



i=l 



4=1 



-. n— 1 



<E«? 



i+l 



< 



3E^ 



i—1 i—1 i—l 

We now try to bound. By our choice of the functions v and u wc get 



E"? = E 



V- I - 
n 



.j\M-xfdx^l^. 



In other words. 



diam(i3) < 2 max 



y^^(7iUid{i) 



i=l 



< 4^^. 



Combining with our estimate of S we get that the diameter of i5~^B is at most 
471^/^. This, as argued, previously, serves as our Lipschitz constant for the Sko- 
rokhod map on G as defined in (j^J). 
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Now consider the RBM Y{t) described in Lemma flTl The ith coordinate process 
Yi{t) has a semimartingale decomposition 

where Mi is a martingale and ^i (t) , 77^ are of finite variation, the former being the 
absohitely continuous part (the drift) and the latter being the component that is 
mutually singular with respect to the Lcbesgue measure (the local time). Compar- 
ing expressions (|1D|) with (|43p we get 

Viit) = —^ {Li,i+l{t) - Li_i,i(t)) + —^Lo{t), io,i = Ln.n+l = 0. 
y 2 \/n 

Here Lo{t) is the local time at zero for the Brownian motion /3(i). The drift is a 
constant Si — l/^/n, and the martingale Mi is a Brownian motion, independent of 
all the other M,'s. 

Thus in Definition [T] one can take the driving noise, ^j, to be a typical path of 
the n-dimensional Brownian motion with constant drifts (Mi + ^i, i = 1, 2, . . . , n). 
Then, according to Theorem[6l this multidimensional noise, during the time interval 
< i < T, satisfies QTCI with a constant Aw'^T. 

We will now use LemmaH) We take the metric space {E, ds) to be C"[0, T] with 
the d metric, and take fj, to be the multidimensional Wiener measure. We also take 
iF,dF) to be C"[0,T] with the metric 



d{u!,uj') = sup . 

0<t<T \ 



1 " 



n 



By ([5S)) . the map that takes i/j ^o fj is Lipschitz. From what we have done so far, 
it is now clear that the map that takes Mi + ^^ to 77 induces a Lipschitz map from 
{E,dE) to [F^dp) that is Lipschitz with a Lipschitz constant 0{n^'^). Since mul- 
tidimensional Brownian motions satisfy QTCI with a constant An~^T , this shows 
that the process fj satisfies QTCI with a constant 0{n^). 

Now consider the map that takes fj to the vector L. We claim that it is Lipschitz 
with a constant which is of order ^Jn. To see this, note that the map that takes L 
to fj is linear: 

1 - ^1 = ^ [e(l) - e(2) I e(2) - e(3) | . . . | e(n - 1) - e{n)] L 
V" V2 

We claim that the smallest eigenvalue of the (n — 1) x n dimensional matrix 
S=^ [e{l) - e(2) I e(2) - e(3) | . . . | e(n - 1) - e(n)] . 

is of the order 1/^/n. 

But it is easy compute compute inner products between columns of S. Thus, for 
any vector v £ M""^, we get 

n— 1 n~l n—1 _. n— 1 9 , 2 

\\Svf = Y.-f-T.-^-^-^^^^' 1y-/.,2,.,2 ^_-r 



i=2 i=l i=2 



The equality above is achieved when vi = V2 = . ■ ■ = Wji-i. Thus, under the 
constraint ^ivf = 1, the infimum of the above norm square is 1/n. This shows 
that the map that takes 77 to L is Lipschitz with a constant 0(1/ y^). 
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Thus the map that takes M + ^ to the vector of local times L induces a Lipschitz 
map with a Lipschitz constant 0{ti?). By Lemma H] it follows then that the law of 
the process (771, . . . ,77„_i) satisfies QTCI with a constant which is of the order of 
n^T. This completes the proof of the Proposition. D 

Finally we prove Theorem [TJ 

Proof of Theorem [H This proof is immediate once we note that the maximum func- 
tion is Lipschitz with respect to the d norm required in Theorem 1181 D 
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